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13  ABSTRACT 


A  method  of  analysis  of  transient  wav  eforms  is  discussed.  Its  properties 
and  limitations  are  presented  in  the  context  of  musical  tones.  The  method 
is  shown  to  be  useful  when  the  risetimes  of  the  partials  of  the  tone  are 
not  too  short.  An  extention  to  inharmonic  partials  and  polyphonic  musical 
sound  is  discussed. 
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INTRODUCTION 


The  analysis  of  the  attack  transients  of  vocal  or  musical 
tones  joes  back  as  far  as  1932  with  Backhaus*  11.21  tunable  resonator 

and  drum  recorder.  Luce  and  Clark  [3.4,51  used  filtering  methods  to 

select  partial  tones  for  analysis  and  recording,  flore  recently,  with 
the  advent  of  computer  music,  analysis  of  musical  instruments  for  the 
purpose  of  simulation  of  timbre  has  been  done  by  what  will  be  called 
a  "hetrodyne  filter11  for  want  of  a  better  name.  Beauchamp  CGI 
analysed  each  partial  of  a  complex  waveform  by  first  multiplying  the 
waveform  by  a  sin  and  cosine  at  the  frequency  of  the  partial  in 
question.  The  result  was  then  low-pass  filtered,  then  squared  and 
summed  Freedman  17,8.91.  and  later  Keeler  110,111  used  a  discrete 
finite  summation  over  one  period  of  the  fundamental  frequency  in 

place  of  Beauchamp’s  low-pass  filter,  an  effect  which  as  we  shall 

show  later  conveniently  places  a  zero  of  transmission  at  all  harmonic 

partials  other  than  the  one  in  question. 

It  is  the  purpose  of  this  article  to  explore  this  method, 

report  its  characteristics,  its  limitations,  its  uses  and  some  simple 
ex  tens i ons. 


2 


List  of  F i gures 


Figure  1  -  Frequency  response  of  a  hetrodyne  filter  with  center 

frequency  of  400  Hz  and  summation  period  of  10  milliseconds. 

Figure  2  -  Frequency  response  of  a  hetrodyne  filter  uith  center 
frequency  of  100  Hz  and  summation  period  of  10  milliseconds. 

Figure  3  -  Frequency  response  of  a  hetrodyre  filter  uith  center 
frequency  of  300  Hz  and  summation  period  of  10  milliseconds  uith  the 
signal  beginning  halfuay  through  the  summation. 

Figure  4  -  Hetrodyne  filter  applied  to  a  signal  uhich  is  a  sum  of 
sinusoids  of  frequencies  100,  200,  and  300  Hz,  uith  exponential  attacks 
of  time  constants  30,  20  and  10  milliseconds  respectively.  The  graphs 
are  of  the  outputs  of  a  hetrodyne  filter  uith  summation  period  of 
10  milliseconds.  The  center  frequencies  are,  top  to  bottom,  100,  200 
and  300  Hz. 

Figure  5  -  Same  data  as  in  figure  4,  but  the  attacks  are  linear  rather 
than  exponential. 

Figure  S  -  The  magnitude  and  phase  of  the  output  of  the  hetrodyne 
filter  uhen  applied  to  a  132  Hz  guitar  tone.  The  apparent  modulation 
is  the  result  of  beating  uith  an  inharmonic  partial  at  18G  Hz. 

Figure  7  -  Frequency  response  of  a  hetrodyne  filter  uhen  the  center 
frequency  is  not  exactly  an  integral  multiple  of  the  summation 
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frequency. 


THE  HETHOO 


Let  us  define  the  hetrodyne  filter  as  follous.  Ue  begin  uith 
a  discrete  function  Fi  which  represents  a  continuous  function  F (t) 
at  discrete  intervals  t-ih,  where  h  is  the  time  between  samples,  h 
is  called  the  "sampling  interval."  The  reciprocal  of  h  is  called  the 
"sampling  frequency"  or  the  "sampling  rate."  Let  us  define  a  and  b 

as  fol lows: 


ofW-1  . 

a  =  Fi  cos(ujoih  +  0O) 


crt-N-1 


i-« 


Fi  sin(moih  +  0o) 


(1) 


(j,c  will  be  called  the  "center  frequency". 

Without  loss  of  generality,  we  may  define 


=  Go  +  ah 


(2) 


and  thus  reurite  the  sum  as  going  from  0  to  N-l. 
N-l 

a  =  21  Fi  cos(u)oih  +  ^o) 
i-0 


b  =  ^  Fi  sin(u)oih  +  ®o) 
i=0 


y 


This  change  of  variables  hides  the  time-position  of  the 
filter  in  the  phase.  Ue  must  remember  that  as  the  filter  is  advanced 
through  time,  the  phase  angle  will  increase,  and  that  any  results 
which  depend  on  this  phase  angle  will  be  functions  of  time. 

Since  the  summation  operation  is  linear,  we  may  represent  the 
input  waveform  Fi  as  a  sum  of  sinusiods  and  may  thus  examine  the 
response  of  the  filter  to  a  sinusiodal  excitation,  as  is  commonly 
done  with  linear  filters. 

N-l 

a  =  21  A  cos(uri.h  +  ^o) 
i-0 

(4) 

N-l 

b  =  £  A  cos(ulh  +  co)  sin(uDih  +  cdo) 
i=0 


With  the  help  of  the  summation  calculus  [12]  and  some  trigonometric 
identities,  we  may  compute  the  summation  in  closed  form  without 
error  as  fol lows: 


s inf  ( orf-mo )Nh/2l  cosf  (g^-u»)fN-l)h/2-KP  +  (0o' 
sinf  (u)-a)o)h/2] 


„o)Nh/2l  cos] 


f N-l^h/2  +cn  -  CT01 


8in[  (<o-a)o)h/2]  J  (5) 

A_  |  sln|  (frtg)o]N'h/2l  sinf  (iu+w)  (N-l)n/2  +CP+ 
b  “  2N  sinf  (oj+o)o)h/2] 

sinf  (m-g)o)Nh/2l  sinf ( ov  tro)  (N-l )h/2  +  ^ 

sinf  (a)-mo)h/2  J 

This  is  not  a  very  useful  expression  as  it  stands,  but  it  may  be 
simplified  somewhat  by  computing  the  sum  of  the  squares  of  a  and  b. 

2..  2  A  f  sin2[  ((ij+(j5o)Nh/2]  ,  sin2f  (n'-„o)Nh/2] 


l  sin^f  (u^‘(B°)h/2]  sin^f  ( (o-n'Ojh/? 


§*rt  s  i  1  i  *h  *>* 


I 
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Thus  we  see  that  in  the  limit,  the  magnitude  of  the  output  of  the 
filter  becomes  independant  of  time  and  becomes  a  measure  of  the 
amplitude  of  the  input  sinusoid.  The  phase  of  the  output  of  the 
filter  remains  always  a  function  of  time,  but  is  also  a  linear 
function  of  time,  its  slope  being  determined  by  the  difference  of  the 
center  frequency  and  the  input  frequency. 


I 

I 

I 

! 

I 
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This  reveals  a  method  of  determining  the  amplitude  of  the 
input  sinusiod  and  getting  a  better  estimate  of  its  frequency.  If  the 
center  frequency  of  the  summation  is  near  the  actual  frequency  of  the 
input  sinusoid,  the  phase  will  be  very  nearly  a  linear  function  of 
time,  thus  ue  may  find  the  frequency  deviation  by  fitting  the  phase 
with  a  straight  line  and  observing  its  slope. 

The  consequences  of  choosing  N  as  above  are  significant.  If 
the  center  frequency  is  a  multiple  of  some  fundamental  frequency, 
then  ue  may  choose  N  to  coinc'de  uith  the  period  of  the  fundamental 
and  thus  cancel  out  all  the  harmonic  partials  except  the  center 
frequency.  Figure  1  shows  the  log  of  the  magnitude  of  the  output 
of  the  hetrodyne  filter  for  a  range  of  sinusoidal  inputs.  The 
center  frequency  in  this  plot  is  400  Hz  and  the  summation  period,  Nh, 
is  10  milliseconds.  Figure  2  shows  the  log  of  the  magnitude  versus 
frequency  for  a  center  frequency  of  100  Hz  and  the  same  summation 
period.  Notice  the  zeros  of  transmission  at  all  multiples  of  the 
summation  period  except  the  center  frequency. 


ERROR  ANALYSIS 


F-eedman  191  and  Keeler  [11  j  both  shou  that  this  method  is 
sufficiently  accurate  for  their  purposes  even  uhen  the  input  signal 
is  not  a  perfect  sum  of  sinusoids.  Keeler  does  not  even  bother 
computing  the  summation  at  each  point,  but  at  regular  intervals  only, 
and  presents  us  with  an  e'egant  proof  that  the  error  in  doing  so  is 
negl igable.  The  above  ucrk  may  well  seduce  one  as  it  did  the  author 
into  believing  that  this  is  a  perfectly  accurate  method,  universally 
applicable.  This  is  not  so.  To  persue  the  matter  further,  let  us 
reformulate  the  equations  somewhat.  He  shall  compute  not  two 
summations  but  one: 


G 

a 


ioph+So 

2_  V 

i=a 


(9) 


The  result  will  be  a  complex  quantity  whose  real  and  imaginary  parts 
correspond  to  the  a  and  b  discussed  earlier.  Ue  will  be  interested 
in  the  magnitude  and  the  phase  of  G.  For  the  input  uaveform,  Fi,  we 
shall  take  a  complex  sinusoid  uith  exponential  decay. 

_  ^  e(8+jaj)ih+eejnflih+Oo 


cvhT  9+  j  (  u'+ojo  )  ]  +  j  ( o ) 
e 


Nhr  a+j(UD+UDo)]  . 

e _  -i- 

hi  a+j(ojf-(J,o)  ]  -  1 

e 


(10) 


Again,  we  see  the  magnitude  is  related  to  the  amplitude  of 
the  input  sinusoid  and  the  phase  drift  with  time  is  related  to  the 
frequency  differece.  The  exponent ial  decay  of  the  input  signal 
causes  imperfect  cancellation  of  other  harmonic  partials,  and 
depending  on  the  speed  of  the  attack,  the  deviation  can  be 

i mpor tant. 

A  more  revealing  case  would  be  to  assume  the  signal  begins 
at  zero  amplitude,  and  rises  exponentially  to  its  steady-state  value 
and  that  the  signal  begins  somewhere  during  the  summation,  say  at 

i-0. 

Ca.e-x"'  (l-e,lh)  >oilrreo 

e(N-p)  jh(tiD+uuo).  L 

«  (o^B)jh(u^wo)+j(e^°)  jhVuj+uA)  T 

e  "  (11) 

■  0  0 _  _  .  -  _ 

h[  a+j(uj+ujo)]  77 

e  L 


This  is  equivalent  to  setting  N  to  some  smaller  value.  As  the 
filter  progresses  through  the  attack,  >he  effective  uidth  of  tho 
window  will  approach  N,  and  the  response  of  the  filter  will  become 
more  representative. 


// 


Here  ue  see  that  even  if  N  and  the  center  frequency  are 
carefully  chosen,  the  1-equency  response  is  not  the  same.  Figure  3 
shous  the  response  for  a  center  frequency  of  300  Hz,  a  summation 
interval  of  10  milliseconds,  and  j3«N/2.  Ue  see  that  the  neighboring 
harmonics  are  not  canceled  out.  This  shows  that  if  the  signal  begins 
anywhere  within  the  sindow.  the  output  should  not  be  taken  to  be  an 
accurate  indication  of  the  amplitude  of  the  partial.  It  is  exactly 
ana  I  agous  to  taking  N  to  be  a  non-multiple  of  the  period  of  the 
fundamental  frequency.  Figure  4  shows  the  output  for  an  averaging 
window  of  10  milliseconds  and  center  frequencies  of  100,  200,  and 
300  Hz.  The  input  was  a  sum  of  sinusoids  of  unit  amplitude  and 
frequencies  100.  200,  and  300  Hz  with  exponential  attacks  of  time 

constants  30.  20,  and  10  m i  1 1  i seconds  respectively.  The  leakage 

among  the  harmonics  is  apparent  here. 

If  the  attack  is  not  exponential,  another  form  of  distortion 
can  occur.  Figure  5  shows  the  magnitude  of  the  filter  output  for  the 
same  input  and  center  frequencies  as  figure  4,  but  with  linear 

attacks  rather  than  exponential. 

The  presence  of  inharmonic  partials  can  cause  an  effect 
similar  to  amplitude  modulation.  Figure  6  shous  the  magnitude  and 
phase  of  the  fundamental  of  a  guitar  note  at  132  Hz.  The  apparent 
modulation  is  caused  by  an  inharmonic  partial  at  186  Hz,  the 
frequency  of  a  known  box  resonance. 

Another  source  of  error  is  that  of  frequency  quantization.  N 
can  not  in  general  be  chosen  such  that  Nh  is  exactly  the  period  of 
the  fundamental  frequency  and  still  have  N  be  an  integer.  This  can  be 
tolerated,  but  it  also  implies  that  the  center  frequency  must  be  a 
multiple  of  2n/Nh  rather  than  a  multiple  of  the  fundamental 
I  f  we  do  not  set  the  center  frequency  to  exactly  a 


frequency. 


multiple  of  2n/Nh,  ue  get  imperfect  cancellation  of  a  pole  and  a 
zero.  Figure  7  shous  the  magnitude  versus  frequency  of  such  a  case. 
Note  the  doublet  around  400  Hz,  the  center  frequency. 

Since  Nh  is  not  exactly  the  period  of  the  fundamental 
frequency,  the  harmonic  partials  are  not  exactly  cancel  led  out. 
Furthermore,  most  string  instruments  show  a  deviation  from  perfect 
integral  multiples  of  the  fundamental  frequency.  This  deviation  also 
contributes  to  leakage  among  the  harmonics.  Equation  10  may  be  used 
to  determine  exactly  how  much  leakage  is  present. 


A  SIMPLE  EXTENSION 


Although  there  would  seem  to  be  no  good  solution  to  the 
problem  of  the  note  beginning  within  the  summation  period,  there  is 
a  technique  for  dealing  with  the  inharmonic  partials.  If  the  attack 
time  of  the  partial  in  question  is  not  too  swift,  it  can  be  filtered 
out  before  analysis  of  the  harmonic  partials  is  done.  Tho  harmonic 
partials  may  then  be  filtered  out  to  allow  analysis  of  the 
inharmonic  partials. 

The  filter  advocated  here  is  a  comb  filter.  This  is 
described  simply  by  the  recurrence  relation: 


Y  =  X  -  X 


n-m 


(12) 


With  frequency  response 


|  Hm(uo)  I  =  \/sin?(u#nh)  +  [  cos(ucP»h)-ll  ‘ 


(13) 


Ue  see  that  the  combfilter  has  a  zero  of  transmission  at  all 
mu'tiples  of  the  base  frequency  1/mh.  The  only  hazards  with  the  comb 
filter  are  those  of  transient  response  and  preturbing  the  harmonic 
partials.  The  transient  response  of  a  comb  filter  is  explicit.  It  is 
identically  zero  beyond  mil  seconds.  If  this  can  be  tolerated,  then 
the  f i I  ter  may  be  useful . 

If  the  frequencyy  of  some  harmonic  partial  falls  near  one  of 


the  zeros  of  transmission  of  the  comb,  it  will  be  attenuated.  Ue  may 
prevent  this  by  a  method  which  appeared  in  Gold  and  Rader  [13,14]. 
At  those  zeros  of  the  comb  that  we  wish  to  eliminate,  a  digital 
resonater  is  used  to  cancel  out  the  zero.  The  details  of  the 
configuration  are  described  in  full  detail  in  the  references  and 
will  not  oe  repeated  here. 

For  analysis  of  an  inharmonic  partial,  the  harmonic  partials 
may  all  be  eliminated  with  a  single  comb,  subject  to  the  limitation 
that  the  partials  may  not  be  exact  multiples  of  the  fundamental,  and 
that  the  frequency  of  the  fundamental  becomes  quantized  when  the 
comb  length,  m,  is  chosen. 

The  comb  will,  o‘  course,  attenuate  the  signal  under 
analysis  by  some  amount.  Ue  may  predict  the  attenuation  from 
equation  13  and  then  multiply  the  results  of  the  hetrodyne  analysis 
by  the  reciprocal  of  the  attenuation  factor  to  obtain  a  better 
estimate  of  the  amplitudes  of  the  partials. 


CURRENT  USES 


The  technique  o‘  robbing  out  unwanted  signals  and  then 
ana  lysing  the  remaining  waveform  is  the  basis  for  an  automated 
system  for  the  analysis  of  polyphonic  music  currently  being 
developed  bu  the  author.  Uhe  i  two  or  more  instruments  are  playing 
simultaneously,  a  Fourier  analysis  is  used  to  get  an  estimate  of  the 
pitch  and  duration  of  each  note.  All  notes  but  one  are  then 

eliminated  by  combing,  jr  d  the  hetrodyne  filter  is  applied  to 

determine  the  attack  time  or  the  note  and  to  correct  the  estimated 
'  tch  of  the  note.  This  is  iterated  for  each  of  the  notes  in  the 
piece.  The  result  is  subjected  to  a  heuristic  analysis  and  is 

eventually  displayed  as  a  musical  score  of  the  piece  under  analysis. 

Also  under  study  by  ?  colleague  is  how  the  physical 
parameters  of  musical  tones  such  as  risetimes  of  the  partials, 

tremelo,  and  steady-state  value  contribute  to  the  perceived  timbre 
of  the  instrument.  The  hetrodyne  method  as  outlined  above  is  U9ea  to 
determine  these  parameters  from  digitized  recordings  of  tones  of 
actual  musical  instruments. 


CONCLUSION 


The  hetrodyne  filter  has  ueen  shown  to  be  a  useful  method 
for  the  analysis  of  the  partials  of  musical  tones  as  long  as  its 
limitations  are  observed.  It  can  fail  if  the  attack  times  are  too 
quick,  if  the  frequencies  of  the  partials  deviate  too  far  from 
perfect  integral  multiples  of  the  fundamental  frequency,  or  if  the 
sampling  rate  ii  so  low  that  frequency  quantization  effects  become 
significant.  I t  can  also  fail  if  substantial  frequency  modulation 
(vibrato)  is  present. 

An  extention  of  the  method  to  trrics  with  inharmonic  partials 
and  even  to  multiple  simultaneous  notes  was  shown  to  be  possible  by 
use  of  the  comb  filter  as  long  as  the  effects  of  the  transient 
response  of  the  comb  was  judged  to  be  tolerable. 

These  methods  are  currently  in  use  by  the  author  and  his 
colleagues  in  the  analysis  of  digitized  musical  sound. 
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